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023403ep JH/BM/ml 

Targeted transgenesis using the Rosa26 locus 

Introduction 

The invention provides a method for targeted transgenesis using the Rosa26 locus 
Suitable nucleotide acid sequences and vectors for the targeted transgenesis are 
provided. The Rosa26 locus proved to be a suitable integration site allowing strong and 
predictable expression of inserted transgenes carrying exogenous promoters. 

Background of the Invention 

The generation of transgenic mice by nuclear injection of purified DMA into fertilized 
eggs is a widely used approach for studying gene or promoter function In vivo 
However, the level and pattern of expression often varies strongly depending on copy 
number, configuration, and integration site of the transgene. In addition, founder mice 
occasionally do not transmit the transgene. Thus, a number of different founders need 
to be generated and tested in order to identify a useful strain, which is a laborious and 
time-consuming undertaking (Bradley et. al., Nature Genet. 14:121-123 (1996); Jasin 
et a«., Proc. Natl. Acad. Scl. USA, 93:8804-8808 (1996); Dobie et al., Trends Genet 
13:127-130 (1997); Garrick et al., Nature Genet., 18:56-59 (1998), Al-Shawi et a..'! 
Mol. Cell. Boil. 10:1192-1198 (1990)). 

To overcome these limitations, homologous recombination In embryonic stem cells has 
been used to produce mice carrying a single copy of the transgene integrated into a 
predetermined site of the genome (Shaw-White et al., Transgenic Res.; (i).i-i 3 
(1993); eronson et al., Proc. Natl. Acad. Sci. USA, 93(17:9067-72 (1996); Hatada et 
al., J. Biol., Chem., 274(2):948-55 (1999); Vivian et al., Biotechniques, 27(l):i 54 -62 
(1999); Evans et al., Physiol. Genomics, Mar. 13, 2(2):67-75 (2000); Cvetkovic etal 
J- boil. Chem., 275(2): 1073-8 (2000); Guiiiot et al., Physiol. Genomics, Mar 13 
(2):77-83 (2000); Magness et al., Blood, 95(11): 3568-77 (2000); Misra etal., BMC 
B.otechnol., i(i):i2 (2001); Minami et al., Blood, 100(12) .-4019-25 (2002)- Tang et 
al. Genesis, 32(3):199-202 (2002)). In these studies, the ubiquitous Hprt locus was 
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more or less successfully used for 'targeted transgenesls'. Insertion of a lacZ gene 
under the control of the polyoma enhancer/HSV thymidine kinase promoter into the 
third exon of Hprt resulted in variable 3-galactosldase expression that was both 
orientation and cell-type dependent (Shaw-White et al., Transgenic Res.; (1):1-13 
(1993)). Although transgenes under the control of the human and the chicken Q-actin 
gene promoter resulted in widespread expression when inserted into the Hprt locus, 
the level of transcripts varied strongly in different tissues (Bronson et al., Proc. Natl. 
Acad. Sci. USA, 93(17:9067-72 (1996)). Unexpectedly, expression of these 
transgenes, but not of the endogenous Hprt gene appeared to be low or undetectable 
in kidney and liver (Bronson et al., Proc. Natl. Acad. Scl. USA, 93(17:9067-72 (1996)). 
" Hatada et al. demonstrated that the HPRT locus suppresses the activity of both, the 
haptoglobin gene promoter as well as the herpes simplex thymidine kinase promoter in 
several tissues of mice (Hatada et al., J. Biol., Chem., 274(2):948-55 (1999)). 
Likewise, a human eNOS promoter-LacZ reporter gene placed in the Hprt locus was 
found to be Inactive in hepatic vessels that otherwise express the endogenous eNOS 
gene (Guillot et al., Physiol. Genomics, Mar. 13, (2):77-83 (2000). Finally, since the 
HPRT gene is on the X chromosome, transgene expression at this locus is subjected to 
random X-inactivation. The expression of the transgene in ail ceils of the female, 
therefore, requires the generation of homozygotes. 

To avoid the 4 complications referred to above, it would be desirable to define an 
autosomal locus that allows strong and predictable expression of transgenes inserted 
through homologous recombination- It is, however, not predictable for a person skilled 
in the art whether chromosomal loci which fulfill these criteria are available at all. 
Exogenous transgenes may not harbor all of the sequences necessary and sufficient 
for proper regulation of transcription and may therefore be influenced by cis-regulatory 
elements near the site of Insertion. 

The rosa26 locus had been identified by random insertion of retroviral sequences and a 
B-galactosidase-neomycin resistance fusion gene into the genome of mouse embryonic 
stem cells (Zambrowicz et al„ Proc. Natl. Acad. Sci. USA, 94, 3789-94 (1997)). The 
rosa26 promoter appeared to mediate ubiquitous expression of promoter-less genes 
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both in embryos and adult mice (Kisseberth at al.. Dev. Biol., 214:128-138 (199m. 
Zambrovvic, et a.., Proc. Mad. Acad. SC. USA, 94, 3789-94 (1997,,, a,ba,tat differe* 
levels in Afferent organs (voo.* et al., EM BO reports, 21:292-297 (2001)). However 
a systematic comparison with other ubiquitous promoters to determine the strength or 
rosa26 promoter had not been performed. In addition, the activity of exogenous 
promoters inserted Into the rosa26 locus has never been examined. 

Summary of the Invention 

The present Invention is based on the finding that a particular chromosomal locus 
present wrth.n the eukaryotic genome (including that of mammaiian ES ce,.s), namely 

inTI; r' 0 ?*' PreSerVati0n ° f the * *«™V of heteroiogous promoters 

mserted through homologous recombinadon at that iocus. This chromosome, iocus Is 
therefore useful in the context of the "targeted transgenesis" approach for the efficient 
generate of transgenic organisms (such as mice) with a predictabie transgene 
expression pattern. ="=.a=ne 

Such a "targeted transgenesis" method comprises conserve experfmental steps A 
gene expression cassette comprising a suitable promoter (e.g. a ubiquitous or tissue 
specific promoter, either inducible or constitutive) functional^ .inked to a gene of 
interest haa to be created; subsequently a vector for the targeted insert,on of the 
above menboned gene expression cassette into the Rosa26 locus has to be generated- 
the insertion of the above mentioned gene expression cassette Into the Rosa26 locus 
through homologous recombination in embryonic stem cells follows; Anally transgenic 

W«tocy'L 9enerated ^ * e lnJeCa ° n °' SUCh 9enetlcall >' mod, " ed cells into 
More specifically present invention provides 

CD a method for generating eukaryotic cells having a modified Rosa26 .ocus, which 
method comprises the following step (hereinafter shordy referred to as step (a))- 
introducing a functions DNA sequence into the Rosa26 iocua of starting eukaryotic 
cells by homologous recombination which with a targeting vector comprising sa.d 
mncdonal DNA sequence flanked by DNA sequences homologous to the Rosa26 .ocus- 
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(2) the method of (1) above, wherein the eukaryotic cells are mammalian embryonic 
stem (ES) cells, preferably are non-human mammalian ES cells; 

(3) a targeting vector as defined in (1) above; 

(4) eukaryotic cells having a modified Rosa26 locus obtainable by the method of (1) 
and (2) above; 

(5) a method for preparing a transgenenlc multi-cell organism having a modified 
Rosa26 locus which comprises utilizing the method as defined in (1) and (2) above; 

(6) the method of (5) above, wherein the transgenenlc multi-cell organism is a non- 
human mammal and said method comprises modifying an ES cell as defined in (2) 
above; 

(7) a transgenic multi-cell organism and non-human mammal obtainable by the above 
defined methods (5) and (6), respectively; and 

(8) the use of the eukaryotic cell of (4) above, the transgenic multi-cell organism of 
(7) above, or the transgenic non-human mammal of (7) above for gene function 
studies, drug development, as disease model, etc. 

The method of the invention offers several advantages over the current technology of 
pronuclear injection. In particular, the targeting vector allows Insertion of a single copy 
of a gene expression cassette, thus avoiding modulation of transgene expression by 
the arrangement of multiple copies. By choosing the autosomal Rosa26 locus as 
insertion site, the expression pattern of the inserted transgene in the non-human 
animal is predictable; random X-in activation and/or modulation by chromosomal 
position effects are avoided. This also eliminates the need to generate and analyse 
multiple transgenic strains for any given transgene. Finally, the Rosa26 targeting 
vector for the site-specific integration can be used for multiple gene expression 
cassettes. 

Description of the Figures 

Houre 1: Targeted insertion of CreER and CAGGS~Cre-ER into the Rosa26 locus. A 
cassette comprising a Cre-ER operationally linked to a CAGGS promoter or a cassette 
comprising a splice acceptor site (SA) linked to a Cre-ER are Inserted into the Rosa2€ 
locus via homologous recombination. A perpendicular dash marks the insertion point 
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within the Rosa26 locus and the rectangular boxes delinate the starting and end points 
of the Rosa26 transcript. 

fjgureZ: Southern Blot analysis of the Inducible recombination of the. Rosa (reporter) 
(A) Genomic DIVA was Isolated from liver (Li) spleen (Sp) and small intestine (Si) of 
transgenic mice carrying the SA-creER/Rosa-rep insert or the CAGGS-creER/Rosa-rep 
Insert. To induce the Cre-ER recombinase the mice were treated with Tamoxifen 
(treated). As a control, a group of mice with the SA-creER/Rosa-rep insert was left 
untreated (untreated). Presence of the reporter band (floxed) and deletion (deleted) of 
it upon an induced recombination event are indicated. (6) Transgenic mice carrying at 
one Rosa26 locus a loxP flanked DNA polymerase p gene segment (por*-) and at the 
other a SA-creER/Rosa-rep were treated with Tamoxifen (treated). A control group of 
mice was left untreated (untreated). Genomic DNA from liver (U) f spleen (Sp), kidney 
(KI). heart (He), lung (Lu), thymus (Th), muscle (Mu), small intestine (Si) and brain 
(Br) was analysed for presence of pol*". In a non-recombination event the por* 7 "* 
band remained (floxed), in a recombination event deletion occurred (deleted). (C) As 
(B), but mice carried instead of the SA-creER/Rosa-rep the CAGGS-creER/Rosa-rep 
insert. 

BsUHLll Western Blot analysis of recombinase and a-actln expression. Proteins were 
extracted from rosa(SA-CreER TZ ) and rosa (CAGGS-CreER 72 ) mice and analyzed as 
described In the "Materials and Method" section. The positions of bands representing 
CreER" and actln are indicated. FA: fat tissue, Ty: Thymus; Sp: spleen, Br: Brain, L U : 
lung, He: heart. 

Detailed Descripion of the Invention 

The term "living organisms" according to the present invention relates to multi-cell 
organisms which can be vertebrates such as mammals (e.g. non-human animals such 
as rodents including mice and rats; and humans) or non-mammals (e.g. fish) or can 
be invertebrates such as insects or worms, or can be plants (higher plants, algi or 
fungi). Most preferred living organisms are mice and fish. 

"Eukaryotic cells" and "starting eukaryotic cells" according to the present invention 
include cells isolated (derived) from the above defined living organisms and cultured in 
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vitro. These cells can be transformed (immortalized) or untransformed (directl 1 
derived from living organisms; primary cell culture). The term "eukaryotic cells" als< 
includes mono-cellular eukaryotic cells such as yeasts, etc. 

It is preferred in the method (1) of the present invention that the eukaryotic cells ar 
derived from a multi-cell organism including vertebrates, invertebrates and plants 
preferably is a vertebrate cell, more preferably is derived from a mammal, includin< 
rodents such as mouse, rat, etc., or a fish such as zebrafish. 

In the methods (1) and (2) of the invention it is preferred that the functional DIM/ 
sequence comprises a gene encoding a protein/peptide of interest (i.e. is a expressible 
and translatable DNA sequence), more preferably said functional DNA sequence is < 
gene expression cassette (a) comprising a gene of interest operatively linked to i 
promoter, or (b) is a DNA sequence which can be converted into such gene expressior 
cassette (i.e. into an operatively linked "promoter-gene of Interest" construct, e.g. b) 
subsequent modification reactions after its integration). The gene of interest within the 
gene expression cassette can be any gene coding for a certain protein/peptide o 
interest, including, but not limited to, recombinases, reporter genes, receptors 
signaling molecules, transcription factors, pharmaceutical^/ active proteins anc 
peptides, drug target candidates, disease causing gene products, toxins, etc. 

The promoter of the gene expression cassette preferably is a ubiquitous or tissu« 
specific promoter, either constitutive or inducible. Preferred ubiquitous promoter is th< 
CAGGS-promoters are CAGGS, hCMV, PGK; preferred tissue specific promoters an 
FABP (Saam & Gordon, J. Biol. Chem., 274:38071-38082 (1999)), Lck (Orban et at. 
Proc. Natl. Acad. Sci. USA, 89:6861-5 (1992)), CamKII (Tsien et al., Cell 87: 1317 
1326 (1996)), CD19 (Rickert et al., Nucleic Acids Res. 25:1317-1318 (1997)), Keratir 
(U et al., Development, 128:675-88 (201)), Albumin (Postic & Magnuson, Genesis 
26:149-150 (2000)), aP2 (Barlow et al.. Nucleic Acids Res., 25 (1997)), Insulin (Ray e 
al., Int. i. Pancreatol. 25:157-63 (1999)), MCK (Bruning et al., Molecular. Cell 2:559 
569 (1998)), MyHC (Agak et al., J. Clin. Invest., 100:169-179 (1997), WAP (Utomo e 
al., Nat. Biotechnol. 17:1091-1096 (1999)), Col2A (Ovchinnikov et ah, Genesis 1 
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26.145-146 (2000)); prefaced inducible 7 promoter systes are- Mx fKOhn - , 
SCnence, 269:1427-1429 ,1995)), tet (Ur,.n g er et „., Proc. N d Ica. Sc, Usl 
97=7963-8 (2000),. Trex (Fe, 9 and Erikso „, Human Gene Therapy. U^) 

0? PPeferred ** *• ° NA SeqUenCeS h0moto «"> us •» *• *osa26 tocus are 

'"" (I ° XP ' etc "> - A signals (such as synthatic polyadenylation 
s,tes, or the poiyadenyladon site of human growth hormones, etc.), spile or 
sequences (such. as a splice acceptor of adenov.rus etc ) In rons Z r 
detecdon, enhancers, se.ec«on markers, etc. ' °* ^ Pr ° te,n 

-m the , and y nJ^T7T^ ££7" " 
home DNA sequences hav,n g the sequin shlnl ^'^1^ 
respectively, and the promoter is a rArr-<= 5 ' 
vector has th. sequence ' ^ 

oTmrntheiT^^zrr (bes,des step (a) — *~» — - 

subsequendyprocessadaccord^oneormoreofthefoHoWnqt; ^ 
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(d) the ES cells obtained in steps (b) or (c) are injected into blastocysts; and/or 

(e) transgenic non-human animals carrying one or more functional genes of interest at 
the Rosa26 locus are generated (viz. by well known breeding procedures). 

The transgenic multi-cell organisms and non-human mammals obtainable by the 
method (5) and (6), respectively; preferably have an operatively functional gene 
expression cassette (as defined above) integrated into its Rosa26 locus. Such 
transgenic multi-cell organisms and non-human mammals are suitable for gene 
function studies, drug development, as disease model animals, etc. 

The invention is further explained by the following examples and the attached figures, 
which are, however not to be construed so as to limit the invention. 

Examples 

Materials and Methods 

Plasmid constrMflion: 

Rosa-targeting vector: A 129 SV/EV-BAC library (Incyte Genomics) was screened with 
a probe against exon2 of the Rosa26 locus (amplified from mouse genomic DNA using 
Rscreenls (GACAGGACAGTGCTTGTTTAAGG) (SEQ ID NO;l) and Rscreenlas 
(TGACTACACAATATTGCTCGCAC) (SEQ ID NO:2)). Out of the identified BACdone a 11 
kb EcoRV subfragment was inserted into the Hindll site of pBS. Two fragments (a 1 kb 
SacII/Xbal- and a 4 kb Xbal-fragment) were used as homology arms and inserted into 
a vector consisting of a FRT-flanked neomycin resistance gene (unpublished) and a 
splice acceptor site from adenovirus (Friedrich G., Soriano P., Genes Dev., 5:1513-23 
(1991)). The CAGGS-promoter or the splice acceptor (SA) and a synthetic intron were 
inserted between the 5' arm and the neomycin resistance gene. The CreERT 2 and a 
polyadenylaOon site (pA) were cloned 3' of the SA and CAGGS-promoter. 

Cell culture: Culture and targeted mutagenesis of ES cells were carried out as 
previously described (Hogan et al., (Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor NY.)/ pp. 253-289.) with ES celt lines derived from both inbred and Fl 
embryos. 
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fifai All mice were kept in the animal facility at Artemis Pharmaceuticals GmhH • 
microlsolator cages (Tecniplast Sealsave) B6D2F1 Mf c . for «T 

^g^f^ iU ^ S " ^m to.L^mBtemeDt^ The production of mice 
by tetraplold embryo complementation was essentially performed as descrZ Z 
et al., ProcNMAce* Sc/ USA, 98, 6209-6214.). ( 

Mqand arlmmwrarinn i 100 mg Tamoxlfen-free base (Sigma TS64m 
suspended m xoo M , Eth ano> end solved ,n 1 m, sunflower o-Ts Z 1 To 
mg/ioo p, tamoxifen so,utlon was sonicate, ■* !- 2 mlnutes ^ £££ a 

_ , _ eeK 0,0 m,ce °y a feeding needle (FST Fine Science 

Tools GmbH, 18061-20). science 

riT. h '° r ' Pr "^ '- : WeStem Wot —»* w « Pen-omned using SDS-PAGE 
(NuPAGE, mvitrogen) and the Breeze Immunodeteebon System (Inv JI) aclrdl 
to ^manufacturer protoco,, Immunodetection w as done I ZZ£ * m«o 
S a C- B,ot 08y . tnc0 aga|nst ^ pRB xo6c » « « e (H 20, 

inir ^ rabWt P ° lyd0na ' I9G ^ C - c 

Example 1 

A CreER" gene (Fe.l et al., (1997) s/ocnem a,op nys «es Commun. 237 752 7S7 , 

* ST«T "I nSerted ,nt ° rMa26 ' 0CUS by h "~ ta »~ <~na«on ,n b 
cells (Fig. 1} . In addlUon/ ^ Crea , n ganfi was S 

sequence (Friedncn and soaano (1991), Genes Dev „ 9 , JlZZZ 
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the endogenous activity of the rosa26 gene promoter (Fig. 1). A /oxP-flankec 
hygromycin resistance gene was introduced into the second allele of rosa26 to provide 
test substrate for Cre ER 72 (Seibler et al., (2003), in press). ES cells modified at botr 
rosa26 alleles were injected Into tetraploid blastocysts and completely ES cell derivec 
mice were generated (Eggan et al., (2001). PNAS, 98, 6209-6214). Rosa(SA- 
CreER^/reporter) and RosaCCAGGS-CreER^/reporter) mice were fed with daily 5 mc 
Tamoxifen for 5 days and recombination of the reporter was analyzed 3 days after the 
last administration. Southern analysis of genomic DNA from different organs showec 
up to 50% recombination in the RosaCSA-CreERP/reporter) mice and up to 90°/< 
recombination in the rosaCCAGGS-CreERF/reporter) mice, respectively (Fig. 2A). As 
the second substrate, we used the loxP flanked DNA polymerase B. gene segment 
(po/5 flox ) (Gu et at., (1994). Science, 265, 103-106). The polB^/rosaCSA-CreER 72 ) anc 
po/^yrosafCAGGS-Cre^^mice were fed with 5 mg tamoxifen per day for 5 days and 
analyzed 3 days later. Southern blot analysis revealed that the /oxP-flanked 
polymerase 6 gene segment was excised in more than 90% of cells In all organs 
except brain in the rosaCSA-CreERp/reporter) mice (Fig. 2B). In contrast, the degree 
of inducible recombination was significantly higher In rosaCCAGGS-CreER^/reporter) 
mice,. reaching 100% efficiency in most organs and up to 70% in brain. 

To investigate the pattern and level of CreER 72 expression in rosaCSA-CreER 72 ) anc 
rosa(CAGGS- CreER 72 ) mice, we performed Western analysis using antibodies specific 
for Cre. The 74 kDa band corresponding to the CreER 1 " 2 fusion protein was detectable 
in all organs of rosa(CAGGS- CreER 72 ) mice, including brain (Fig. 3). In contrast, the 
CreER 12 expression level In rosaCSA-CreER 72 mice was significantly lower compared tc 
the rosa^GGS-CreER 72 ) strain and appeared to be undetectable in brain (Fig. 3). 
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SEQUENCE LISTING 

<llo> artemis Pharmaceutical GmbH 

<12 0> Targeted transgeneais using the Rosa26 locus 

<130> 023403ep JH/BM 

<140> 
<141> 



<160> 7 



<170> Patenttin ver. 2.1 

<2lo> l 
<21l> 23 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> SaSreeSJs 011 ° f Artificial sequence, primer 

<400> 1 

gacaggacag tgcttgttta agg 
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<21Q> 2 
<2ll> 23 
<212> DNA 

<2X3> Artificial Sequence 

<22Q> 

<2 ' 3> ° f ^ tificlal sequence, primer 

<400> 2 

tgactacaca atattgctcg cac 
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<210> 3 

<21i> 1313S 

<212> DNA 

<213> Mua musculua 

<220> 

<223> Description: Rosa26 locus 
<400» 3 

aagcttctca cgtagcaacc agagctccag 
actcctgtga tccaacacag gagcaacctt 
cctttttttg gggggggggg gggaacaagt 
tttagaaagg aacaatccta aggtcacttt 
aaataccaaa ttccaaatat ccgttttaat 
ttttagtttt aaaaaggtca attttccaaa 
aggaacaaga gccttgggtc atgtcctacc 
caatctgaat aaocttaaag cctctaatcc 
ctaatgatcc tcctgagaaa actggaagaa 
aaactcagaa atctctaatt atgtcagtat 
acatctaaaa taacctctta aagttttcct 



agccagcagc tgctgccgcc tfcgtatactc 60 
ttctttaccc cacccccact tcttaacaca 120 
gctccatgct ggaaggaetg gaactatgcc leo 
taaattgagg tctttgattt gaaaatcaac 240 
taaaccagca atgtggatat aagcattaag 300 
cattcagcaa tcatatttaa atttacagct 3S0 
aaagaacata actcaacatt ctacacatga 420 
cataacaggc cacaaatttt ggacagagaa 490 
atccagggaa aagaaattcc tgtgtcctcc S40 
tctctgcfctt agtcctaggt cagattgcac 600 
cctagcgacc taaaccatta ntaatatcaa Geo 
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attaaccatc aaaacacttt cctctcaaca tgctgcacac aaacctcctc ctggaacctc 720 
ctccatctgg atcctcccca atcaaaagta taggtattta acatataagc aaggaagtaa 780 
tgtaaacacg accttggtca caaatatgtc atctaaaaac aatttagtca aggtatggag 840 
gaaattcgag aacctgaatc tttttaagta ttttgagcac aggaacaatt ggcaaaagga 900 
atccaggtat agacaaaacc cagagcccag agctctgggc gaaaaatgag ttgctggtga 960 
agacgttaca caagtaacat gagaaagcag aaaatgcagg tcatccacgc acccctgacc 1020 
caggccagca gggcgggctg cagcatcagt acacaggagfa aagatcctta tccctaagaa 1080 
tgagaaaggc aaaggcgccc gatagaataa attagcatag aaggggcttt cccaggagtt 1140 
aaaaccttcc ttctgagcga ttacctacta aaaccagggc ttttgcccac taccatttac 1200 
ctaggatctt ggcttgcacg gattcatagg ggcatatccc tccccctctt ctttagagtc 1260 
gttcttaaaa gatcgctctc caogccctag gcagggaaaa cgacaaaatc tggctcaatt 1320 
ccaggctaga accctacaaa ttcaacaggg atatcgcaag gafcaccgggg catacgccac 1380 
agggagtcca agaatgcgag gtgggggtgg cgaaggtaat gtctttggtg tgggaaaagc 1440 
agcagccatc tgagatagga actggaaaac cagaggagag gcgttcagga agattatgga 1500 
ggggaggact gggcccccac gagcgaccag agttgecaca aggccgcaag aacaggggag 1560 
gtggggggct cagggacaga aaaaaaagta tgtgtatttt gagagcaggg etgggaggcc 1620 
tctcctgaaa agggtataaa cgtggagtag gcaataccca ggcaaaaagg ggagaccaga 1680 
gtagggggag gggaagagtc ctgacceagg gaagacatta aaaaggtagt ggggtcgact 1740 
agatgaagga gagcctttct ctctgggcaa gagcggtgca acggtgtgta aaggtagcfcg 1800 
agaagacgaa aagggcaagc atottcctgc taccaggctg gggaggccca ggcccacgac 1860 
cccgaggaga gggaacgcag ggagactgag gcgacccttc tttcccccgg ggcccggtcg 1920 
tgtggttcgg tgtctctett ctgttggacc cttacettga cccaggcgct gccggggcct 1980 
gggcccgggc tgcggcgcac ggcactcccg ggaggcagcg agactcgagt taggcccaac 2040 
gcggcgccac ggcgttccct ggccgggaat ggcccgtacc cgtgaggtgg gggtgggggg 2100 
cagaaaaggc ggagcgagcc cgagcgggga gggggagggc caggggcgga gggggccggc 2160 
actactgtgc tggcggactg gcgggactag ggctgcgtga gtctctgagc gcaggcgggc 2220 
ggcggccgcc cctcccccgg cggcggcagc ggcggcagcg gcggcagctc actcagcccg 22 80 
ctgcccgagc ggaaacgcca ctgaccgcac ggggattccc agtgccggcg ccaggggcac 2340 
gcgggacacg ccccctcccg, ccgcgccatt ggcctctccg cccaccgcce cacacttatt 2400 
ggccggtgcg ccgccaatca gcggaggctg ccggggccgc ctaaagaaga ggctgtgctt 2460 
tggggctccg gctcctcaga gagcctcggc taggcagggg atcgggactc cggcgggagg 2 620 
gcggcttggt gcgfctfcgcgg ggatgggcgg ccgcggcagg ccctccgagc gcggtggagc 25BQ 
cgctctgtga gacagccggg tacgagtcgc gacgctggaa ggggcaagcg ggtggtgggc 2640 
aggaatgcgg tccgccctgc agcaaccgga gggggaggga gaagggagcg gaaaagtctc 2700 
caccggacgc ggccatggct cggggggggg ggggcagcgg aggagcgctt ocggccgacg 2760 
tctcgtcgct gattggcttc ttctcctccc gccgtgtgtg aaaacacaaa cggcgegttt 2820 
tggttggegt aaggcgcctg tcagttaacg gcagccggag cgcgcagccg ccggcagcct 2 880 
cgctctgccc actgggtggg gcgggaggta ggtggggtga ggcgagctgg acgtgcgggc 2940 
gcggccggcc tctggcgggg ogggggaggg gagggagggt cagcgaaagt agctcgcgcg 3 000 
cgagcggccg cccaccctcc cctccctctg ggggagtcgt cttacccgcc gccggccggg 3060 
cctcgtcgtc tgattggctc tcggggccca gaaaactggc ccttgccatt ggcccgtgtt 3120 
cgtgcaagct gagtccatcc gccggccagc gggggcggcg aggaggcgct cccaggttcc 3180 
ggccctcccc tcggccccgc gccgcagagt ctggccgcgc gcccctgcgc aacgcggcag 3240 
gaagcgcgcg ctgggggcgg ggacgggcag tagggctgag cggctgcggg gcgggtgcaa 3300 
gcacgtttcc gacttgagct gcctcaagag gggcgtgctg agccagacct ccaccgcgca 3360 
ctccggggag tggagggaag gagcgagggc tcagttgggc tgttttggag gcaggaagca 3420 
cttgctctcc caaagtcgct ctgagttget atcagtaagg gagctgcagt ggagtaggcg 3 400 
gggagaaggc cgcacccttc cccggagggg ggaggggagt gttgcaatac cctcctggga 3540 
gttctctgct gccccctggc ttctgaggac cgcoctgggc ctgggagaat cccctceccc 3600 
tcttccctcg tgatctgcaa ctccagtctt tctagaagafc gggcgggagt cttctgggca 3660 
ggcttaaagg cfcaacctggt gtgtgggcgt tgtcctgcag gggaattgaa caggtgtaaa 3720 
atcggaggga caagacttcc cacagatttt cggctttgtc gggaagctfct ttaatagggg 3780 
aaaataagga aaatgggagg ataggtagcc atctggggtt ttatgcagca aaacbacagg 3840 
ttattattgc ctgtgatccg cctcggagta ttttccatcg aggtagatta aagacatgcc 3900 
cacccgagtt ttatactctc ctgcctgaga tccttactac agtatgaaat tacagtgtcg 3 960 
cgagttagac tatgtaagca gaattttaat catttttaaa gagcccagta cttcatatcc 4020 
atttctcccg ctccttctgc agccttatca aaaggtatct tagaacactc attttagccc 4080 
catcttcatt tattatactg gcttacccaa cccctagaca gagcattggc attttccctt 4140 
tcctgatctt agaagcccga tgactcatga aaccagacag actagctaca tacaccacaa 4200 
atcgaggctg tagctggggc ctcaacactg cagctctttt ataactcctt agtacacttt 4260 
ttgttgatcc tctgccttga tccttaattt tcagtgtcta tcacctctcc cgtcagtggt 4320 
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ggttgatggt aagcttataa ggaacttgaa agtggagtaa ctactccatt tctctgaggg 8040 
gagaattaaa atttttgacc aagtgttgtt gagccactga gaatggtctc agaacataac 8100 
tticttaagga accttcccag attgccctca acactgcacc acatttggtc ctgcttgaac 6160 
attgccatgg ctcttaaagt cttaactaag aatattaatt gtgtaattat tgtctctcct 8220 
cctttagatc actccttgag gacaggacag tgcttgttta aggctatatt tctgctgtct 8280 
gagcagcaac aggtcttcga gatcaacatg atgttcataa tcccaagatg ttgccattta 8340 
tgttctcaga agcaagcaga ggcatgatgg tcagtgacag taatgtcact gtgttaaatg 8400 
ttgccatgca gtttggattt ttctaatgta gtgtaggtag aacatatgtg ttctgtacga 8460 
attaaaotct taagttacac cttgtataat ccatgcaatg tgttatgcaa ccaccacttt 8520 
aagtattgta gctttctttg tatgtgagga taaaggcgtt tgtcataaaa tgttttgaac 0560 
atctccccaa agttccaaat tacaaaacca caacgttaga aettatttat gaacaatggt 8640 
tgcagtctca tgcttttaaa atgcttaatt attcaactaa caccgtttgt gttataatat 8700 
atataaaact gacatgnaga agtgtttgtc cagaacattt cttaaatgta tactgtcttc 87 60 
agagagctca atatagcatg tcttttgcaa catactaact tttgtgttgg tgcgagcaat 8820 
attgtgtagt cattttgaaa ggagtcattt caatgagtgt cagattgttt cgaatgttat 8880 
tgaacatttc aaatgcagac ttgttcgtgt tttagaaagc aaaaotgtca gaagctttga 8940 
actagaaatt aaaaagctga agtatttcag aagggaaata agctacttgc tgtattagtt 9000 
gaaggaaagc gtaatagctt agaaaattta aaaccatata gttgtcattg ctgaatatct 9060 
ggcagatgaa aagaaatact cagtggcccc tttgagcaat ataacagctt gttatattaa 9120 
aaattttccc cacagatata aactctaatc tataactcat aaatgtcaca aatggatgaa 9180 
gcttacaaat gcggcctgac ttgtcactgt gcttgcttta gttatgtgaa agtttggcaa 9240 
taaacctatg tcctaaatag tcaaaccgtg gaatgacttt ttaatctatt ggtttgtcta 9300 
gaacagtcac gctgccattt gccctaatgg tgaaagaaaa agtggggagt gccttggcac 9360 
tgttcatttg tggtgtgaac caaagagggg ggcatgcacc tacacttcaa acatccttct 9420 
gaaagactga caagcttggg tcttcacagt tggaattggg catccctttt gtcagggagg 94B0 
gagggaggga gggaggctgg cttgttatgc tgacaagtgt gattaaattc aaactctgag 9540 
gtaagttgga ggaacttgta cattgfctagg agtgtgacaa tttggactct taatgatfctg 9600 
gtcatacaaa atgaacctag accaact'tct ggaagatgta tataataact ceatgttaca 9660 
ttgacttcac ctgactaata cttatccctt atcaatcaaa tacagaagat gccagccatc 9720 
tgggcctttt aacecagaaa tttagcutca aactcctagg ttagtgttct cactgagcta 9780 
catcctgatc tagtcctgaa aataggacca ccatcacccc caaaaaaatc tcaaataaga 9840 
tttatgctag tgtttcaaaa ttttaggaat aggtaagatt agaaagtttt aaattttgag 9900 
aaatggcttc tctagaaaga -tgtacatagt gaacactgaa tggctcctaa agagcctaga 9960 
aaactggtac tgagcacaca ggactgagag gtctfctcfctg aaaagcatgt attgctttac 10020 
gtgggtcaca gaaggcaggc aggaagaace tgggctgaaa ctggtgtctt aagtggctaa 10080 
catctccaca actgatgagc aagaacttta tcctgatgca aaaaccatcc aaacaaacta 10140 
agtgaaaggt ggcaatggat cccaggctgc tctagaggag gacttgactt ctcaccccat 10200 
cacccacacc agatagctca tagactgcca attaacacca gcctctagcc tccacaggca 10260 
cctgcactgg tacacataat ttcacacaaa cacagtaaga agccttccac ctggcatggt 10320 
attgcttatc cttagttccc aacacttggg aggcagaggc cagccagggc tatgtgacaa 10380 
aaaccttgtc tagaggagaa acttcatagc ttatttccta ttcacgtaac caggttagca 10440 
aaatttacca gccagagatg aagctaacag tgtccacuat attfcgtagtg ttctaagtca 10500 
attttttaaa tatacttaac agaaccaaag ctacggtgaa ccaagtacaa acctggtgta 10560 
ccaacttgag aacctagcat aaaaagCagc tcatctgttc agcaaatatt aaatgcctac 10620 
tggcaaagat tatgtcagga acttggtaaa tggtgatgaa acaatcatag ttgtacatct 10680 
tggttctgtg atcaccttgg tttgaggcaa- aagtggctcc tttgatcaag gatggaattt 10740 
taagtctaca ttcaatcaac aatgtattat tttgtgattg caaaattgcc tacctagggt 10800 
ataaaacctt taaaaatttc ataataccag tccantctcc agtcactaat tccaaaaagc 10860 
cactgacT:at ggtgccaatg tggattctgt tctcaaagga aggattgtct gtgcccttta 10920 
ttctaataga aacatcacac tgaaaatcca agctgaaaga agccagactt tcccaaataa 10980 
acaactttcc ataaagctca aacaaggatt acttctagga ggcactgtta aggaactgat 11040 
aagtaatgag gttactcata taatgatagt cccacaagac tatccgagga aaaatcagta 11100 
caactcgaaa acagaacaac cagctaggca ggaataacag ggctcccaag ccaggaggtc 11160 
tatccaacac ccttttctgt hgagggcccc agacctacat accgtataca aacagggagg 11220 
tgggtgattt taactcccct gaggtacott ggtaaatctt tgtcctgagt aagcagtaca 11280 
gtgtacagtt tacattttca tttaaagata cattagcccc ctctaccccc taagactgac 11340 
aggcactttg ggggtgggga gggctttgga aaataacgct tccatacact aaaagagaaa 11400 
tttctttaat taggcttgtt ggttccatac atctactggt gtttctacta ctcagtaata X1460 
ttataatagfc cacacaagca tctttgctct gtttaggctg tatatttatt ttaaggcaga 11S20 
tgataaaacc gtagatctta agggacgctc ccgctcctga gatgatacaa agaatttiaga 11S80 
ccataaaaca gtaggttgca caagcaatag aatatggcct aaagtgttct gacactcaga 11640 



25353 &°5| SSSSS SSSSE 2=3*- s-~ 
55555 3SSS& SSSS 3S5S a '™ S2! 

caaaggectg agcagtgcgc tttlfaffffc SSteSS ^ ctt 9 aCct gaaggatgaa liseo 
cccagaccac atggaccttt cgcclcalat "aaaggccc ccgatgetca U940 

aaaaaacata agtcattagg ctaccagSt SSSSSSS. a f ataa 9S cc tggcacacac 12000 
taaatgctat gggtggtggg ttggaaaqtt caacc taaaa tcttcccact 12060 

Wtot 39 g ttcaglttl? ga? 9 *^ ££tKE SSZH 015 9 fcttt ^gag 12?20 
gtccatccaa caccctcttc tattoao^ f tataacecca gttctagggo I2ian 

aggtgagtga tttaactctc 5£2gS£ 22SfS? aaa ^"tgtgt acaaaclflf Hill 
gcagtacagc cctgccfcgt? SgSSK 22££ ^ tetCB ^9*9*™ SIS 
gahagagatt ccaatctctc ttctfggatc SitSSKS *™? CC9a a 9gcaagctg 123?? 
tattgcaaaa ggtaaatagg tagtttctta eatSSX* aaagaaggtc aagtttaatc 12420 
gcagtcatct tetacaggcc aglgceSta SSi?^** 9 aaca aatct taggtgtgaa 12480 
agtgttatag agttagllca gtgtatagt? ttctate^ 3 at9aa 99aaa «gecagtcc iltll 
tgcaactcag cttattgaag LLJS3K agtajaaat? a * actttttt tttaaacaac 126OO 
ctcagcctac aatgtggaat aggaccataf StSKSS ^gtccaagaa gcaagtgctt 12660 
atgtttttct gactgagaag allatae^ * a »? 9 ac&9 t 9agcgaaat gaattatggc 12720 
agaatceaat StetScKS ^ccaaaSg £££££ ^ cat ^ tat "atttaSa u™ 
"WP agatatScff SSSSEZ EE"** ccaca 9 aa 3= "840 
oX« ttCC9 ^agacaaa ggtaagaca? Sfaatata? tt*^" 90 * aacccc *9gt: 12900 
gctcagcaaa Caatggtact ggccattaao SteSSf!?!: * ttccca agc taatgagatg 12960 
SSST" 0 aa 99agagac SStSS cag^glaK SSSSS^ CCtcagggal 130 2 S 
ccacacacaa ataaatacaa taaaaaacat trtSSSS SSteSS accct *acgt: 13080 

ww^a^agccg aatttctaaa agggcgaat 13139 



<210> 4 

<21l> 1073 

<212> DMA 

<«213> Mus mus cuius 

<220> 

<223> Deacripcion: s« arm for Hosa26 

<400> 4 



EESSZ SSSSS SE322 SKSS c ? 99tac9a * 60 

III a f aa99Sr 5525£ HSSSS? ™^ a3CaaC °93 a 39sis* 120. 

9cggaggagc gcttccggcc gacgtctcg? JS2E£« ^h 99993 99933gggca 180 
tgtgaaaaca caaatggcgt attfct««^ 9 « u 9 fl 3 cttctettcc tcccgccotg 240 

l?a£l C3Ca 9Ccgc 9 SS locate? ^cISIIS 9 22S C * 9tt aa =9l«Ic? iSS 
£f a f 9C9ag ct 99acgtgc Sggcglggcc gg^teSS? ^f?ggcggga ggtaggtggg 360 
gg9tcagcga aagtagctcg cgcqcaaaca zSZZzSSEF' 9gggcggggg aggggagggl 4 20 
tegtttfeacc cgccgccggl JSSSf*? 0 CC f ccctt « tctgigglg Jeo 

tggcccttgc caccigctlg 9°t=tcgggg cceagaaaal 540 

ggegaggagg cgctcccagg ttccScIS? a t=cgccggc cagciggggc ?oo 

gcgcgcccct gegcaacgtg gca«?faSS ™ f" CC Cc g c gc=g<=a gagtctgicc 660 
tgagcggctg cglggcgfg? JSgSK ctc^ 99 ? 9 9c 993gacgg gcagtallgc 720 
getgagecag acctccatcg cgcactcefo ° CtS a 9 tc 9cctca agagggiclt 780 

gggctgeett ggaggcagg? lg«lt?gS 9 f? a ?nff a I? 9aa ?9 a gcga gggclclgft ill 
aagggagctg cagcggaj^a gfcgSSSSa a^™ 96 c ? c tctgagt tgttatcagt 900 
gagtgtcgca acac«c£ct liSIttJfc Jlfto?™ ^tetcegga ggggggagfg 9eo 
gggcctggga gaatcccttc SSc cl^a^ gSSgJg gaccgccct 1020 



<210> S 

<211> 4333 

<212> DNA 

«2X3> Mus musculus 
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<220> 

<223> Descriptions 3 r arm for Rosa26 



<400> 5 , 
tagaagatgg gcgggagtct fcctgggcagg 
tcctgcaggg gaattgaaca ggtgtaaaat 
gttttgtggg gaagttttfct aataggggca 
ctggggtttfc atgcagcaaa actacaggtt 
tnccatcgag gtagattaaa gacatgctca 
cttactacag tatgaaatta cagtgtcgcg 
ttttnaaaga gcccagtact tcacatccat 
aggtatttta gaacactcat tttagcccca 
cctagacaga gcattggcat tttcccttcc 
ccagacagat tagttacata caccacaaat 
gttcttfctat aactccttag cacacctctt 
agtgtctatc acctctcccg tcagtggtgn 
agcttcacaa caatagatgt attgagaatc 
aatgcctctc tcctttttcc ccatttataa 
tggatgtctc ctccccatat tacctgafcgt 
gacattaaaa ggtatatttc attattgagc 
tttgtcattg tacacatctg taaaaggtgg 
gttgtctttc ctgacctaag gtcttgtgag 
ctcttggact ggcttgactc aeggcattcfc 
gccaagcttc ttcaggacct ataattttgc 
ttaagcaaca aatgtatetg tgaagcttgg 
gctctataat aatactatcc aggggctgga 
gctcttccag aagtcctgag ttcaattccc 
taatgggatc tgatgccctc ttctggtgtg 
taaataaatc ctccttcttc ttcttttttt 
aatttactga agtaatgaaa fcactttgtgt 
tajCtctttaa gcactggaaa tgttaccaag 
gacagaggag ccataactgc agacttgtgg 
cttttctctt acactaagca ataaagaaat 
taaactgcta gctttactta acttttgtgc 
gcagccatta ctaaacatga aagcaagcaa 
cagagtttag cc.agccagtg gtggtgcttg 
ggcagagaca ggcagatctc tgagfcfctgag 
taggatagcc aggaafcacac acagaaaccc 
aaactataat tgaagcattc cctaatgagc 
ttctgatgag atttgggtat tattttttct 
tgtgggcttt ctttaaagcc tccttcctgc 
catggcttaa atggcatggc tttttgcctc 
atttceaggg tggggttggg aaatctttca 
taaaaaatgg gttatataat aaaccccata 
antaaatgag tccctcccct ataaaagagc 
tttttttaaa tgtggfcatct fccagaaccaa 
ttgcatcgct taatcagatt ttctagtttc 
aaagtcaaat taagaatttc tigacttttaa 
gatagctttt cctgaggcag ggectcacta 
gtagactaaa gttctgcctg cttfctgtctc 
ttttggaact tgcaggtcag acecttetat 
gctggtaaaa tgtgtttcaa gtgatgaaaa 
taaaaaaaaa agccaggcct gttagagcat 
cacaagagta gctacttggc aggctcctgg 
aaccaagaaa ctacagttaa ggttaccCgt 
ctaaaatatt ccaaacaata ttucattcaa 
tctccggtga ataggcagag ttggaaacta 
tgttttcaag tgafcagfctaa agcccafcgag 
cfcfcggtttat actcagaagc actfcctfctgg 
ggcctacctt aggetgatcg ttgttcaaat 
atgatttcat atatcaaggc aaaacatgtn 



cttaaaggct aacctggtgt gtgggcgttg 60 
tggagggaoa agacttccca cagattttcg 120 
aataaggaaa atgggaggat aggtagtcat 18 o 
attattgctt gfcgatccgcc tcggagtatt 240 
cccgagtttt atactctcct gcttgagatc 300 
agttagacta tgtaagcaga attttaatca 360 
Ctctcccgct ccttccgcag ccttatcaaa 420 
fctttcattta ttatactggc ttatccaacc 480 
ctgatcttag aagtctgatg actcatgaaa 540 
cgaggctgta gctggggcct caacactgca 600 
gfctgatcctt tgccttgatc cttaatttfcc 660 
tccaoacttg ggcctattct cagtccaggg 720 
caacctaaag cttaactttc cactcccatg 780 
actgagctat taaccattaa tggttccagg 940 
atcttacata ttgccaggct gatattttaa 900 
cacafcggtat tgattactgc ttzactaaaat 960 
fctccttttgg aatgcaaagt tcaggfcgttt 1020 
cttgtatttt ttctatttaa gcagtgcttt loao 
acacgttatt gctggtctaa atgtgattbt 1140 
ttgaottgta gccaaacaca agtaaaatga 1200 
tttttaggct gttgtgttgt gtgtgcttgt 1260 
gaggtggctc ggagttcaag agcacagact 1320 
agcaaccaca tggtggctca caaccatctg 1380 
tctgaagacc acaagtgcat tcacattaaa 1440 
ttttfcttaaa gagaatactg tctccagtag 1500 
ctgttccaat atggtagcca ataatcaaat 1560 
gaactaactt ttatttgaag tgtaactgfcg Iff 20 
gatacagaag accaatgcag actttaacgt 1680 
aaaaattgaa cttctagtau cctacttgtt 17*40 
ttcatctata caaagctgaa agctaagtct 1800 
tgataatttt ggatttcaaa aatgtagggc I860 
cctttatgcc tttaatccca gcactctgga 1920 
cccagcctgg tctacacatc aagttctatc 1980 
tgtfcggggag gggggctctg agatttcata 2040 
cactatggat gtggctaaat ccgtctacct 2100 
gtctccgccg tcggttgggt cttttgacac t 2160 
catgtggtct cttgtttgct actaacttcc 2220 
ctaagggcag ctgctgagac ctgcagcctg 2280 
aacactaaaa ttgtccttta atttttcttt 2340 
aaatagttat gaggagtgag gtggactaat 2400 
tattaaggcc ttttgtctta tacttaactt 2460 • 
gggtcttaga gttttagtat acagaaactg 2S20 
aaatccagag aatccaaatc cttcacagcc 2580 
tgttaatttg cttactgtga atataaaaat 2640 
tgtatctctg cctgatctgc aacaagacat 2700 
ccgaatacta aggttaaaat gtagtaatac 2760 
^ggggacaca ctaagggagc , tcgggtgata 2 820 
cttgaactat tatcaccgca acctactttt 288 0 
gcttaaggga tccctaggac tcgctgagca 2940 
tgagagcata tttcaaaaaa caaggcagac 3 000 
ccttaaacca tctgcatata cacagggata 3060 
gctttccccc atcaaattgg gacatggatt 3120 
aacaaatgtt ggttttgtga tttgtgaaat 3180 
atacagaaca aagctgctac ttcgaggtct 3240 
grtcccccgc actatcctga tcatgcgcta 3300 
aaacttaagt ttcctgtcag gtgatgtcat 3360 
atacacgtta aacatctgta cttaatgtga 3420 
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£3535 S5&3S 25555 55555 2—- «*agtca ttc 3 4ao 

etagtaaccc tacctataga Pactttg5 gg5l2a5 J?? a " Caa9 ac ^tctgac 3540 
cctcgaagtt gagacgtttt Ittagtgtat SS555? HZ?? 3 * 9 ** aa 9 ea "tca 3600 
ccagaagata ttcaggaeea CttcSgactg 55155= a * 9tt TO B SS acaegtttat 3660 
tagtgagcat tgagtggcct ttaggcttqa afcSSfK* fct 9attetga tCagcactgt 3720 
ccggccttct ttaaaaagcc ettittotte 55555 f^ 9 * atatc tcaaataatg 3780 
aaagaaatac ttgtttggat ctcctttt5 !52 ? gt tttcta <=ata atttttgttc 3840 
ctttcctttt ctStttSet t5gg5tf a S5S5E SP?*?"* 9Ccata? C tt llol 
tgtcctggaa ctcactttgt agaccaggct 5555!? " fcctct 9 ta eagccctggc 3960 
gcctcctgag tgccgggatt aaaggcgtgc 25525S 5 agaaat : CC 9Cctgcctct 4020 
gteatataac cataaccaat actilctcca 5S55I? f^getaagt tggatatttt 4060 
ttaagcggtc tttattggcc ettcattaaa 555S5 =*«aat:tca gtcagtagtc 4140 
geactagtgg caettaagca ac5555 55555 « C «^ aaca Wtgteg 4200 
gaaactagtc tagcgttttg tacacccacl ?5555l * aga " aa 99 gtcagggata 4260 
atttcaggac ate c a 9<=tttatac caccttgttc tgatagaaat 4320 



<210> 6 
<211> 6039 
<212> DNA 

<213> Artificial Sequence 
<220> 

<400> 6 



£3555 5S555 55555 55525 I5 atta9tt S o 

acccccgccc attgaegtea ataatgaelt 55eS5£ fcctggctga ccgcccaacg 120 
tccattgacg tcaatgggtg gaceaf5Sc 55552 ^aacgeca atagggactt 180 
tgtatcatat gecaagtaci ccc555S 25552? ° ca « et W« gtacateaag 240 
aetatgecca gtacatgacc ttatgggac! 5S55X£ C " taaac 99 cccgcctggo 300 
teategctat taccaegggt cgagg?55 55553 9cagtacatc tacgtattag 3 60 
cccccctccc cacccccaat tttJKSt? 55??5?!r gcttcactct ccccatctcc 420 
||9 3 cggggg ggggggggg C gcggSagg 55555 552"*** ^cagegatg 4 80 
ggegaggegg agaggtgegg eggcageeaa tiff?? 9999 c 9999<=gagg ggeggggegg 540 
catggegagg cggclglgfl gJSSSSS 5S535 ? C9 f tcC9aa agtttcctt? 600 
cgctgcgttg ccttcgcccc gtgSScgc* 55555 r?? 909 *" 0 999Cgggagt 660 
ctgactgacc gcgttactcc cacag55a 5S5Sf« ^cgcgccgcc cgccccggct 720 
taattagege ttggtttaat gacgg55? 3lt3IJ5 ?gcccttctc cteegggctg 780 
agggctcegg gagggecett fgtffggglg 55555 2 9 5 9 ? 9t3 aaa acctcal 8*0 
tgtgcgtggg gagcgccgcg tgcggccS? 22255? 9999ggtgcg tgcgtgcgtg 900 
eggegegggg ctctgtgcgc tfcl555 f?2?? "° 9 9 ct 9 c 9agc getgegggei 960 
ccgcggtgcg gggglgltj? 52S25 E25SZS 9 ° 9C " CC 99 gggcggtfcc 1S20 
99tgagcagg gggtgtgg|c £clf2g5g 5f?55f? ? c ""5 9t 9 cgcgcggggg 1080 
cgagttgctg agcacgiccc iglt515? 2555?° cccccct 9c* cccccctccc 1140 
ctcgccgcgc cggg Cggggg SSggcSSS 522?5? 9 t 9cggggcg eggegegggg 1200 
egggcegggg agggcccJii ggagggf5I 52222? c 999cggggc ggggccgcct 1260 
gegeggegag ccgcagccal 32552!! 2?II? ° 9 9agcgccggc ggctgecgag 1320 
etttgeceea aaLtfgcgg af5g55e SSSSSS ° 9a9a " 9C9 cagggacltl 138? 
gegegggega ageggtgegg cgcclgca5 afffaff^ c 9 cc 9 cacc ° cctctagcgg 1440 
tcgccgcgec gccgteccll tS5ISS?f 5l?^ aat " 9c 99ggaggg ccttcgtgc? 1500 
geettegggg gggfcgggg^ agggljggg? SS5SS £552*°* 9gggacggc? J5 
gaagcgttgg ggtgagtaet elStctllfa aS25J5 9 9 <=9t9tgaec ggeggctcta 1620 
agtttccaaa aacgalgagg atttoatafcf a ?°9ggcacg acttctgcgc taagattgtc 1680 
99ecgcgtce atctg|?2I 55X55 S55SST ? c " t9at 9«= ctttgag|gt 1740 
gectgagate CggcSatacI c5glg55 55515? Z***9 a **9* ggtgtggcag iaoo 
ggtgtccact cccagggcgg cctlcggSc 52525 a «*ttgcctt tctetccaca I860 
cgtegaccat gtccaattta ctgaeSgSI S5KSS cc 9^taggc tagccggccg i S20 

s 3cac a ccaaaactt gectgeatta ccggtcgatg i960 



EmPfansszeit 8. Jan. 12:35 
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caacgagtga tgaggttcge aagaacctga 
ctgagcatac ctggaaaatg cttctgfcccg 
tgaataaccg gaaatggttt cccgcagaac 
ttcaggcgcg cggtctggc^ gtaaaaacta 
ttcatcgtcg gtccgggctg ccacgaccaa 
ggcggatccg aaaagaaaac gctgatgccg 
aacgcactga tttcgaccag getcgttcac 
tacgtaatct ggcatttctg gggattgctt 
ccaggatcag ggttaaagat atctcacgta 
gcagaacgaa aacgetggtt agcaccgdag 
ctaaaccggt cgagcgatgg atctccgtct 
tgttttgccg ggtcagaaaa aatggtgttg 
ctcgcgcccn ggaagggatt tttgaagcaa 
actctggtca gagatacctg gcctggtctg 
atatggcccg cgceggagtt tcaataccgg 
taaatattgt catgaactat atccgtaacc 
tgcfcggaaga tggcgattct gctggagaca 
tcatgatcaa acgctctaag aagaacagcc 
tcagtgcctt gttggatgct gagcccccca 
ecttcagega agcttcgatg atgggctnac 
acacgatcaa ctgggcgaag agggtgccag 
tccaccttct agaatgtgcc tggctagaga 
tggagcaccc agtgaagcta ctgtttgctc 
aatgtgtaga gggcatggtg gagatcttcg 
gcatgatgaa tctgcaggga gaggagtttg 
ctggagtgta cacatttccg cccagcaccc 
accgagtcct ggacaagatc acagacacct 
ccctgcagca gcagcaccag cggctggccc 
acatgagtaa caaaggcatg gagcatctgt 
tctatgacct gctgctggag gcggccgacg 
gaggggcatc cgtggaggag acggaccaaa 
cgcattcctt gcaaaagtat tacatcacgg 
gagcggccga ccggttcgag atccaggcgc 
atctgtgtgt tggctttttg tgtgccttgg 
aagggggagg gggaggccag aaegaccttg 
gagggggagg ccagaatgag gcgcgccggt 
ggaacttcgg aataggaact tctfcaggtca 
aaggcagtct: ggagcatgcg ctttagcagc 
cctctggcct cgcacaeatt ccacatccac 
tggccccttc gcgccacctt ctactcctcc 
gctcgcgccg tgcaggaegt gacaaatgga 
ggacagcacc gctgagcaat ggaagcgggt 
ttgctccttc gctttctggg ctcagaggct 
gggcgggctc aggggcgggg cgggcgcccg 
gctbcaaaag cgcacgtctg ccgcgctgtx 
ctgcagccaa tatgggatcg gccattgaac 
cttgggtgga gaggctattc ggctatgacc. 
ccgccgtgtt ccggcegtca gcgcaggggc 
acggtgccct gaatgaactg caggacgagg 
gcgttccttg cgcagctgtg ctcgacgttg 
tgggcgaagt gccggggcag gatctcctgt 
ccatcaeggc tgatgcaatg cggcggctgc 
accaccaagc gaaacatcgc atcgagcgag 
atcaggatga tctggacgaa gagcarcagg 
tcaaggcgcg catgcccgac ggcgaggatc 
cgaatatcat ggtggaaaat ggccgctttt 
tggcggaccg ctatcaggac atagcgttgg 
gcgaargggc tgaccgcttc ctcgtgcttt 
tcgcccccta tcgccttctt gacgagtcct 
agaaattgat gatctattaa acaataaaga 
tactttgtta agaagggtga gaacagagta 
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tggacatgtt eagggatcgc caggcgtttt 2 040 
tfctgccggtc gtgggcggca tggtgcaagt 2100 
ctgaagatgt tcgcgattai: cttctatatc 2160 
tccagcaaca tttgggccag ctaaacatgc 2220 
gtgacagcaa tgctgtttca ctggttatgc 2280 
gtgaacgtgc aaaacaggct ctagcgttcg 2340 
tcatggaaaa tagcgatcgc tgccaggaca 2400 
ataacaccct gttacgtata gccgaaattg 2460 
ctgacggtgg gagaatgfcta atccatattg 2520 
gtgtagagaa ggcacttagc ctgggggtaa 2580 
ctggtgtagc tgacgatccg aataactacc 2640 
ccgcgccatc tgccaccagc cagctaccaa 2700 
ctcatcgact gatttacggc gctaaggatg 2760 
gacacagtgc ccgtgtcgga gccgcgcgag 2820 
agatcatgca agctggtggc tggaccaatg 28B0 
tggatagtga aacaggggca atggtgcgcc 2940 
tgagagctgc caacctttgg ccaagcccgc 3 000 
tggccttgto cctgacggcc gaccagatgg 3060 
tactctattc cgagtatgat cctaccagac 3120 
cgaccaaccc ggcagacagg gagctggttc. 3180 
gctttgtgga tttgaccctc catgaecagg 3240 
tcctgatgat tggtcfccgtc tggcgctcca 3300 
ctaacttgct crtggacagg aaccagggaa 3360 
acatgctgct ggctacatca tctcggttcc 3420 
cgtgccccaa atctattatt ttgcttaatt 3480 
tgaagtctct ggaagagaag gaccatatcc 3540 
tgacccacct gatggccaag gcaggcctga 3600 
agctcctcct catcctctcc cacatcaggc 3660 
acagcatgaa" gtgcaagaac gtggtgccce 3720 
cccaccgcct acatgcgccc actagccgtg 3780 
gccacttggc cactgcgggc tctacttcat 3B40 
gggaggcaga gggfcttccct gccacagtct 3900 
ggatcaataa aagatcatta ttttcaatag 3960 
gggaggggga "ggccagaatg aggcgcggcc 4020 
ggggaggggg aggccagaat gaccttgggg 4080 
aaccgaagtt cctacacccc ccagagaata 4 14 0 
attctaccgg gtaggggagg cgcttfctccc 4200 
cccgctgggc acctggcgct acacaagtgg 4260 
cggtaggcgc caaccggctc cgttcttcgg 4320 
cctagtcagg aagttccccc ccgccccgca 4380 
agtagcacgt ctcactagtc tcgtgcagat 444 0 
aggccttcgg ggcagcggcc aatagcagct 4500 
gggaaggggt gggtccgggg gcgggctcag 4560 
aaggtcctcc ggaggcccgg cattctgcac 462 0 
ctcctctccc tcatctccgg gcctttcgac 4680 
aagatggatt gcacgcaggn tctccggccg 474 0 
gggcacaaca gacaaccggc tgctccgatg 4800 
gcccggttct ttttgtcaag accgacccgt 4 860 
cagcgcggct aticgtrggctg gccacgacgg 4920 
tcactgaagc gggaagggac tggctgctat 4980 
catctcacct tgcccccgcc gagaaagtat 5040 
atacgcttga tccggctacc tgcccattcg 5100 
cacgtactcg gatggaagcc ggtcttgtcg 5160 
ggcccgcgcc agccgaactg ttcgccaggc 5220 
tcgtcgtgac ccatggcgac gcccgcttgc 5280 
ctggattcat cgactgtggc cggctgggtg 5340 
ctacccgtga tattgctgaa gagcctggcg 5400 
acggtatcgc cgctcccgat tcgcagcgca 5460 
cctgagggga tcgatccgct gtaagtctgo 5520 
tgtccactaa aatggaagtc tfctcctgtca 55 80 
cctacattcc gaatggaagg attggagcta 5 640 



cgggggtggg ggtggggtgg gattagataa atgectqcte ttta P »- naaw „ - «... 
. ategcettat gataacgCtt catagttgga tafcaclatt SSSS^l 9 9ctctttac t S700 
aagggccagc tcattcctcc cactlatfat cta?aoJ?^ ? Ca aaaccaaa <* 5760 

ttgtttttee cttgattccc actttgtgX cetaStaet !2£2 tCt C9t93 3 at « SB2 ° 
tttcatagcc tgaagaacga ' gateagclgl "c?gttcla SSS^ aatgtgtcag S880 
ttgttttgcc aagttctaac tccatlagaa SSSetS. a ttctcagta 5940 

atactttcta gagaatagga acctcgglat IggXctSc 9atCCC9C9c *9 aa 9tt:cc t 6000 



<210> 7 
<2H> 14411 • 
<212> DMA 

«213> Artificial Sequence 
<220> 



<223> Description of Artificial Sequence, taraetino 

vector for Roea26 locua with^ C^3Sftt«rt 
<400> 7 

SSSSS SSSSSi lltttttlll SET** a ^cagcga gtcagtgagc 60 
taafgcag?? gg!algl"g gtttcccgtc SSSSS SK?" 9 120 
aatgtgagtt agctcactca tcaoacac« ^ff^f?? 99 9ca 9 fc 9agcg caacgcaatt 130 
atgttgtgtg gtattgtgag cgtlflacaa SSSeaSf a «ttatgct tccggctcgt 240 
tacgccaagc gcgcaattal ecft^^f "* c * oaca 9 gaaacagcta tgaecatgat 300 
cgafggccat fgag^acgc tagggacaat SSSSS alSf 6989 atcta9a ^ >«• 
Scgcggtgga gccgtcctgt gailllgccS SKSSJ a ? CC9C " ca "ecctccga 420 
cgggtggtgg gcaggaatgc ggf?cgl«? %££££ SET'S" aa S3ggcaag 48 0 
cggaaaagtc tccacoggac gcggclataa SmIS^^S fagggggagg 9agaagggag 540 
ttccggccga cgtctcglcg I?gStSgf? Stt^ 9 ?! ^ g ^ca.gc ggaggagcgc 600 
aatggcgtgt tttggttggl gtaaeSacc E2£2^ cc 3=cgtgtg tgaaaacaca 660 
cgccggcagc ctciltcEgc ceacIISatS cggcagccgg agtgcgcagc 720- 

ggacgtgcgg gcgfggtcIS ccrStffno 9 - 999C999a 99 taggtggggt gaggcgagct 780 
gtagltlgll fglgllcggl cgccScc?? £b£S3£ f^agggagg gtcaglglaa 840 
ccgccggccg ggectcatlo cltaa*^^ ^cccttcccc tgggggagtc gttttacccg 9Q 0 
ttggctlgtl S£££3 tSSSteS 12252SZ ca 9 aaaact 9 gcccttgccl 960 
ctcccaggtt ccggccc**! cct^gccSc gcIcS?aga SSSSS 9 C9a " a 9 9 *9 1020 
gcaacgtggc aggaagcgco oactoaoaar 9 9tctggccgc gcgcccetgc 1080 

gggcgggtgc aalca?g?£? cIga????K S5S3SS a * Ca 9"ccg agcggctgcg 1140 
ctccatcgcg cactcciggg agtg^Kf aalaScaaoI H9 ? 9 ? C9tgC ^S^ac 1200 
aggcaggaag cacttgltlt SSSotS tte£2E£ fSS*?^ 93 9°Cgctctgg 1260 
gcggagtagg cggggigaag gccgcaccc? tet™^ 9 ttatcagtaa gggagctgca 1320 
acctttctgg gaplctctg I?Sctcc?a SaSSSE 9999a 99gga gegcegcaat 1380 
atcccttccc ccccttccc? cgtgatrtSl factSfo? 9 a ^9ccctgg gcctggg aga 1440 
ctgcaggttt tcgacattca tfcafc 8 -^?^ ^ 9tC tttc Caggea accgataccc IS 00 
tagttlltag ccla?ata?| gagtclcgcj ftacaSact 22S? W *f a *9999tcat 1560 
gctgaccgcc caacgacccc wcccatt« tac 93taaat ggcccgcctg 1620 

cgccaatagg gactltccat SacSc2t 2S?2S»^ C ? a * 9Cat9 " cccatagtaa 1680 
tggcagtaca tcaagtgtat cffaf^^ 99 ?^ 99acta tttac ggtaa actgcccact 1740 
aatggcccgc c^fatSt SSSSSS. ZSSSSSZ taCt ? ac 9^ aatgacggta "00 
acatctacit atSgecaS StStaoci ta«S 9 fgactttccc acttggcagt i8 60 
actctcccca tctccccccS 2?SScSS5 ccfft^ 9 t9a9C ^ CaC 9tcccgcrtc 1920 
ctttgcgcag cgatgaaaac aaaof^f ccaatttcgt atttatttat tctttaabta 1980 
cgagfggcg^ 3S555 SSSSS fll?^ 9 «^! 9 ^ 999 C 99 9 9 c 9999 2040 
ccgaaagctt ccctltltfg IlallcfgE gccaatcaga gcggcgcgct 2100 

gcggcgggcg ggagtcgc?g SS52S2 *£££££ £££££ a 9 c 9aagcgc 2X60 
ccgcccgccc cggccctga? cgacLcgtt IctccSeaa ^-f^ C9C9 cc 9 cctc g«9 2 220 
ttctcctccg ggctgtaatt agcgctSSt ftSnff^ 9 9 5 9a ff 93 9 c 9Egacggccc 2280 

ssss SS33K S ssHF asss ssss ss 

t 9 . s = 9 « 9 c -j-jg- ™^ -jj-jjjj. jjj. 



gccgggggcg gtgccccgcg gtgcgggggg 
gtgtgtgcgt gggggggtga gcagggggtg 
ccgcaccccc ctccccgagt fcgctgagcac 
gggcgtggcg cggggctcgc cgtgccgggc 
ggggcggggc cgcctcgggc cggggagggc 
ccggcggctg tcgaggcgcg gcgagccgca 
gggcgcaggg acttcctttg tccgaaatct 
caccccctct agcgggcgcg ggcgaagcgg 
gagggccttc gtgcgtcgcc gcgccgccgt 
cgcaggggga cggctgcctt cgggggggac 
tgaccggcgg ctctagaagc gttggggtga 
tgcgctaaga ttgtcagttt ccaaaaacga 
gatgcctttg agggtggccg cgtccatctg 
cctgaggtgt ggcaggctfcg agatccggcc 
gccttcctct ccacaggtgt ccactcccag 
taggctagcc ggccgcgtcg accatgtcca 
cattaccggt cgatgcaacg agtgatgagg 
atcgccaggc gttttctgag catacctgga 
cggcatggtg caagttgaat aaccggaaat 
attajtcttc t . atatcttcag gcgcgcggtc. 
gccagctaaa catgcttcat cgtcggtccg 
tttcactggt tatgcggcgg atccgaaaag 
aggctctagc gttcgaacgc actgafcttcg 
atcgctgcca ggatafcacgt aatctggcat 
gtatagccga aattgccagg atcagggtta 
tgttaatcca tattggcaga acgaaaacgc 
ttagcctggg ggtaactaaa ctggtcgagc 
atccgaataa ctacctgttt tgccgggtca 
ccagccagct atcaactcgc gccccggaag 
acggcgctaa ggatgactct ggtcagagat 
tcggagccgc gcgagatatg gcccgcgccg 
gtggctggac caatgtaaat attgtcatga 
gggcaatggt gcgcctgctg gaagatggcg 
tttggccaag cccgctcacg atcaaacgct 
cggccgacca gatggtcagt ggcttgttgg 
atgafccctac cagacccttc agtgaagctt 
acagggagct ggttcaeatg atcaactggg 
ccccccatga tcaggtccac cttctagaat 
tcgtctggcg ctccacggag cacccagtga 
acaggaacca gggaaaatgt gtagagggca 
catcatctcg gttccgcatg atgaatctgc 
tcattttgct taattctgga gtgtacacat 
agaaggacca tatccaccga gtcctggaca 
ceaaggcagg cctgaccctg cagcagcagc 
tctcccacat caggcacatg agtaacaaag 
agaacgtggt gcccctctat gacctgcegc 
cgcccactag ccgtggaggg gcatccgtgg 
cgggctctac ttcatcgcat tccttgcaaa 
tccctgccac agtctgagcg gccgaccggt 
cattatttcc aatagatctg tgtgttggtt 
gaatgaggcg cggccaaggg ggagggggag 
agaatgacct tgggggaggg ggaggccaga 
actttctaga gaataggaac ttcggaatag 
ggaggcgctt ttcccaaggc agtctggagc 
gcgctacaca agtggcctct ggcctcgcac 
ggctccgttc tttggtggcc ccttcgcgcc 
cccccccgcc ecgcagctcg cgtcgtgcag 
tagtctcgcg cagatggaca gcaccgctga 
cggccaatag cagctttgct ccttcgcttt 
cgggggcggg ctcaggggcg ggctcagggg 
qccggcattc tgcacgcttc aaaagcgcac 
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gctgcgaggg gaacaaaggc tgcgtgcggg 2560 
tgggcgcggc ggfccgggctg taaccccccc 2640 
ggcccggctt cgggtgcggg gctccgtgcg 2700 
SSggggtggc ggcaggtggg ggtgccgggc 2760 
tcgggggagg ggcgcggcgg ccccggagcg 2820 
gccattgcct tttatggtaa tcgtgcgaga 2880 
ggcggaggcg aaatctggga ggcgccgccg 2940 
tgcggcgccg gcaggaagga aatgggcggg 3.0 oo 
ccccttctcc atctccagcc tcggggctgc 3060 
ggggcagggc ggggttcggc ttctggcgtg 3120 
gtactccctc tcaaaagcgg gcatgacttc 3180 
ggaggatttg atattcacct ggcccgcggt 3240 
gtcagaaaag acaatctttt tgctgtcaag 3300 
atacacttga gtgacattga catccacttt 3360 
ggcggcotcc ggagcgatcg ccggtccgcc 3420 
atttactgac cgtacaccaa aatttgcctg 3460 
ttcgcaagaa cctganggac acgtrcaggg 3540 
aaatgctccc gtccgtttgc cggtcgtggg 3600 
ggttccccgc agaacctgaa gatgtccgcg 3660 
^gg.cagtaaa aactatccag caaqatttgg 3720 
ggctgccacg accaagtgac agcaatgccg 3780 
aaaacgttga tgccggtgaa cgtgcaaaac 3840 
accaggttcg ttcactcatg gaaaatagcg 3900 
ttctggggat tgcttataac accctgccac 3960 
aagatatctc acgtactgac ggtgggagaa 4020 
tggttagcac cgcaggtgta gagaaggcac 4080 
gatggatfctc . cgtctctggt gtagctgatg 414Q 
gaaaaaatgg tgttgccgcg ccatctgcca 4200 
ggatttttga agcaaetcat cgattgattt 4260 
acctggcctg gtctggacac agtgcccgcg 4320 
gagtttcaat accggagatc atgcaagctg 4380 
actatatccg taacctggat agtgaaacag '4440 
attctgctgg agacatgaga gctgcqaacc 45 oo 
ctaagaagaa cagcctggcc ttgtccctga 4560 
atgctgagcc occcatactc tattccgagt 4620 
cgatgatggg cttactgacc aacctggcag 4680 
cgaagagggt gccaggcttt gcggatttga 4740 
gtgcccggct agagatcctg atgattggtc 4800 
agctactgtc tgctcctaac ttgctcttgg 4860 
tggtggagat cttcgacatg ctgctggcta 4920 
agggagagga gtctgtgtgq ctcaaatcta 4980 
ttctgtccag caccctgaag tctctggaag 5040 
agatcacaga caccttgacc cacotgatgg 5100 
accagcggct ggcccagctc ctcctcatcc 5160 
gcatggagca tccgtacagc atgaagtgca 5220 
tggaggcggc cgacgcccac cgcctacatg 5280 
aggagacgga ccaaagccac ttggccactg 5340 
agcattacat cacgggggag gcagagggcr 54 00 
tcgagatcca ggcgcggatc aataaaagat 5460 
ttrtgcgtgc cctgggggag ggggaggcca 5520 
gccagaatga ccttggggga gggggaggcc 5580 
atgaggcgcg ccggtaaccg aagttcccat 5640 
gaacttctta ggtcaattcc accgggtagg 5700 
atgcgcttta gcagccccgc tgggcactcg 5760 
acattccaca tceaccggta ggcgccaacc 5820 
accttctact cctcccctag tcaggaagtt 58 80 
gacgtgacaa atggaagtag cacgtctcac 5940 
gcaatggaag cgggtaggcc tttggggcag 6000 
ctgggcccag aggctgggaa ggggtgggtc 6060 
cggggcgggc gcccgaaggt cctccggagg 6120 
gtccgccgcg ctgttctcct cttcctcatc 6180 
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cccgggcctt tcgacctgca qccaataton « ^ 

caggfctctee ggccgcttgg gtggagagf? 221122 ^ aacaa 9rat ggattgcacg 6240 
tcggccgctc tgatgccgcc gtottScSS 22222 2 aCt999Ca caa * a 9acaf «oi 
tcaagaccga cctgtccggt gccctgaitg 22222 ""^cccg 9"ctttttg 6360 
• ggctggccac gacgggcgtt ScttgScS 22221! c 9 a 99cagcg cggctaccgE 6420 
gggaccggct gctattgggc gaagtgfcaf 22f2! 9a c 9 ct 9teact gaagegggaa 6480 
ctgccgagaa agtatccalc 2gfc?2?g S2SSS2 ^? tCatCt cac ^tf?tc 22 
ctacctgccc atecgaccac caagegaaac 2!!?f? 9 9 9Ctgcatacg cttgatccgg 66oo 
aagccggtct tgtcgatcag g22t22 2a!2 tC9a 9cgagcacgt actSggatS 666? 
• aaccgttcgc caggcccaai g2!E2c£ 21! 9S9Ca tca 9999ct:c gcgcSgcff 22 
gcgatgcccg cttjccgaa? SSS" 8 * 9gatctcgtc Sfc£oa£2 S2 

gtggccggct-gggtgtggcg g2cgct2f 222?! C9 cttt "tgga fetcatcgae? 22 
ecgaagagct tggcggcgal cgqa222 !2?! aCagc 9ttggctacc cgtgatatto 22 
ccgattcgca gcgcltclcc 22222 2222 9t 9*tttacgg t atcg2g2l £2 

22222 ^ 9CaSaaa "9a a g2?t 22222 2222 9& ^iatlgat 22 
aagttttccc tgecafcactt tgttaagaaa 0,222! aaagaegecc aataaaafcgg 70 00 

12222? a r tac " 99 9?9gggg a gl £££t3 2222" actcc 9aag 522 
tgaaggetct ttactattgc tttatgataa tot??;?2 9 a taaatgcc tgctctttac 72 00 
aagcaaaacc aaattaaggg ccagc?22 222! tag tfc 99atatca taatctaaac 7260 
tctctcgtgg gatcatttgtt 22222 22222 at 9 at **ata gatctataga 73S 
ttccaaatgh gtcagtttca tagcetoafa 22! ac £ Ct 9tggttccaa gtactgtgle 7300 
acttcancet cagtittgt? 22212? 223!** 9cagcctctg ItccaXfJc ?J2 
cgcgocgaag ttcctataet ttS22f2 ctaattccat cagaagctga etctagatcc 7snn 
aagcgccaga agatgggcgj 22222 22222 ^taggta ct?c2 & c?t 756? 
gcgttgtcct gcagggiaat 5S222 ? 9 f 9 ?f tta a a9gctaacc tggcgtgcgg 7s2 
ttttcggttt tgtcggaag 22222 2 aaatt9ga gggacaagac ttlccalS? 22 

ESSES? 9gct??a ^ 52222 22222 222 at99 ??X 

agtattttcc atcgaggcag attaaaaaea 222! ttgcttgtga tccgccccgg 7000 
gagatcceta ctaeagEatg a2222J Jf" Caccc 9 agttfctatac tctcctgcft ilea 
taatcacttt taaaglgccc a222c2 f2 9 ? asitc a 9 a =tatgta agcagalttc 22 
atcaaaaggt attttagaac 22222 2222 Ct ccc 9<=*«tt cfgcSJStt 22 
ccaaccccta gacagaicat 22222 ?22 att " cac "a«at actggl22 22 
atgaaaccag acagltfag? 22X22 2222" tctta ^9t c C gf?g22 Hit 
actgeagcce ttttataact cc?tag22 22222 9 f 9ta S c t9 ggicctcSS 5JJ 
attetcagtg tctatcacct eccccgtc2 222?2 9 at ccCfctg cc ttgatcctta 8220 
ccagggagtt ctacaacaat ag2g?2S 2£?2 C °* oattt 99gcc taStctcagfe 0280 
cccatgaatg cccctctect tltt?t2at 2222!° ctaaa 9«ta acttt2act a!2 
22f! t99a c 9tctccecc cc222cc 22222 a3Ctat «= a ac oattaacSt llil 

22222 222? 9Ca ta t tt ca2a 3SS522 2222 9 ? ^2 

caaaatctcg toattgtaca catctgtaaa a ««SlZ a tsgtattgafc cactgcttac 0520 
gtgtttgttg tcttttcctga cot2I22 a9 9tS3ttcc tcttggaacg caaagt22 ala! 

222 ctct ^ctggS SSHSJ c2f222 222 ttc? at «Kg2I 

12222! a 9 c "cttca gfacctatal 22gc22 222 9Ct9 9 c «aaatgb 8700 
aaatgactaa gcaacaaatg tatttatoaa «!^S r5? a ctt 9tagcca aaeacaagta B7so 
gcttgtgctc tataacaacf 22222 KS*?** ta 9Stbgttg tgttgtgtgt 22 
cagactgctc teccagaagt eate2tS f2S a9a " t 9gctcggag ttaaSSSa 22 
catctgtaat gggatlfeglt g2?222 22222 accacat 9S* 99Ctc!cI2 22 
actaaataaa caaatcctcc tbctt22t t?2222 aa 9 acca c aa gcgcattcac 22 
cagtagaatt tactgaagta aco22!2 22 ttttfc ttta aagaga atlctgtctc 22 
tcaaattact ctttlagc2 2«222 22 9CCtgt tooaa tatgg hag222a 22 

2a2?22 ? a99a 9«at 225212 22222 22 cttca? e AS5fi £*o 

caatgtcttt tctcttacae haa rt « af »r« uw 3fcaggaca cagaagacca atocaaa^i- *->a « 

222 taaa ct9cta 92? ^2222? 22222 ?2 9aacttc 22 

aagtctgcag ccactactaa acawaaaoe !2h I tc tatacaaa gctgaaagct 9360 
tagggccaga gtttagccag 2ag?522 22222 aa " ct 99at ttc22!2 12? 
tetggaggca gagacaggca gatfcltglg 22222 tat 9°cttta atcccagcal 22 
222 Ca " ata 9=caiga Ita22laf 22222 9ccc 99tcta cacatcLgt 22 

22222 ta £ aatt 3** 9cat222 22222 I?^!? 9 ? 99 9 ctc tgag! c ^tl 
|S22I2 IJ2222 L S S222 2f£i2 fW^ SSSSS KS 

acttcccatg gcttaaatgg ca t g 9 gc Ct 2 ^21221 g~? t ^222 22 
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agcctgattt ccagggtggg gttgggaaat ctttcaaaca ctaaaattgt cctttaatfct 9900 
tttttttaaa aaatgggtta tataataaac ctcataaaat agttatgagg agtgaggtgg 9960 
actaatatta aatgagtccc tcccctataa aagagctatt aaggcttttt gtcttatact 10020 
taactttttt fcttaaatgtg gtatctttag aaccaagggt cttagagttt tagcatacag 10080 
aaactgttgc atcgcttaat cagattttct agttfccaaat ccagagaato caaattcttc 10X40 
acagccaaag tcaaattaag aatttctgac ttttaatgct aatttgctta ctgtgaatat 10200 
aaaaatgata gcttttcctg aggcagggtc tcactatgta tctctgcctg atctgcaaca 10260 
agatatgtag actaaagttc tgcctgcttt tgtctcctga atactaaggfc taaaatgtag 10320 
taatactttc ggaacttgca ggtcagattc ttttataggg gacacactaa gggagcttgg 10380 
gtgatagttg gtaaaatgtg tttcaagtga tgaaaacttg aattattatc accgcaacct 10440 
actttttaaa aaaaaaagcc aggcctgtta gagc'atgctt aagggatccc taggaccfcgc 10500 
tgagcacaca agagtagtca cttggcaggc tcctggtgag agcatatcta aaaaaacaag 1Q560 
gcagacaacc aagaaactac agttaaggut: acctgtcttt aaaccatctg catatacaca 10620 
gggatattaa aatattccaa ataatatttc attcaagttt tcccccatca aattgggaca 10680 
tggattectc cggtgaatag gcagagttgg aaactaaaca aatgttggtc ttgtgatctg 10740 
tgaaattgct ttcaagtgat agttaaagcc catgagatac agaacaaagc tgctatfctcg 10800 
aggtctcttg gtttatactc agaagcactc ctttgggttt ccctgcacta tcctgatcat 10860 
gtgctaggcc taccttaggc tgattgttgt tcaaataaac ttaagtttcc tgtcaggtga 10920 
tgfccatatga tttcatatat caaggcaaaa catgttatat atgtfcaaaca tttgtactta 10980 
atgtgaaagt taggtctttg tgggtttgat ttttaatttt caaaacctga gctaaataag 11040 ' 
tcatttttac atgtcttaca tttggtggaa ttgtataatt gtggtttgca ggcaagacfcc 11100 
tctgacctag taaccctacc tatagagcac tttgctgggt cacaagtcta ggagtcaagc 11160 
atttcacctt gaagttgaga cgttttgtta gtgtatacta gtttatatgc tggaggacat 11220 
gtttatccag aagatattca ggactatttt tgactgggct aaggaattga ttctgattag 11280 
cactgctagt gagcattgag tggcctttag gcttgaattg gagtcacttg tatatctcaa 11340 
ataatgctgg ccttttttaa aaagcccttg tfcctctatca ccctgttttc tacataatct 11400 
ttgttcaaag aaatacttgt ttggatcecc ttttgacaac aatagcatgt tttcaa'gcca 11460 
tattttttrt cctttttttt tttttttfctg gttttccgag acagggtttc tctgtatagc 11520 
QQtggctgtc ctggaactca ctttgtagac caggcfcggec tcgaactoag aaatccgcct 11580 
gcctctgcct cctgagtgcc gggattaaag gcgtgcacca ccacgcctgg ctaagttgga 11640 
tattttgtta tataacxata accaatacta actccacfcgg gtggactctt aattcagfcca 11700 
gtagtcttaa gtggtcttta ttggcccttc atnaaaatct actgttcacfc ctaacagagg 11760 
ctgctggtac tagtggcact taagcaactt cctacggata tactagcaga ttaagggtca 11820 
gggatagaaa ctagtctagc gttttgtata cctaccagct: t^atactacc ttgttcfcgat 11880 
agaaatattt caggacatct agcacccaat ccgccctata gtgagtcgfca ttacaattca 11940 
ctggccgtcg ttttacaacg tcgtgactgg gaaaaccctg gcgtxaccca acttaatcgc 12 000 
cttgcagcac accccccttt cgccagctgg cgtaatagcg aagaggcccg caccgatcgc 12060 
ccttcccaac agttgcgcag cctgaatggc gaatgggacg cgccctgtag cggcgcatta 12120 
agcgcggcgg gtgtggtggt tacgcgcagc gtgac'cgcta cacttgccag cgccccagcg 12180 
cccgctcctt tcgccttctt, cccttccttt ctcgccacgfc tcgccggcct tccccgtcaa 12240 
gctctaaatc gggggctccc tttagggttc cgatttagtg ctttacggca cctcgacccc 12300 
aaaaaacttg attagggtga tggttcacgt agtgggccat cgccctgata gacggttttt 12360 
cgccctttga cgttggagtc cacgttcttt: aatagtggac tcttgttcca aactggaaca 12420 
acactcaacc ctatctcggt ctattctttt gatttataag ggatcttgcc gatttcggcc 12480 
tattggttaa aaaatgagct gatttaacaa aaatttaacg cgaattttaa caaaatatta 12540 
acgcttacaa tfctaggfcggc actettcggg gaaatgtgcg cggaacccct atctgtttat 12600 
tttcctaaat acattcaaat acgtatccgc tcatgagaca acaaccctga taaatgcttc 12660 
aataatattg aaaaaggaag agtatgagta ttcaacattc ccgtgtcgcc ettatrtccct 12720 
tttttgcggc attttgcctt cctgtttttg ctcacccaga aacgctggtg aaagtaaaag 12780 
atgccgaaga tcagttgggt gcacgagtgg gtcacatcga actggatctc aacagcggta 12840 
agatccttga gagttttcgc cccgaagaao gttttccaat gatgagcacc ttcaaagttc 12900 
cgctacgtgg cgcggtatta tcccgtattg acgccgggca agagcaactc ggccgccgca 12960 
tacacfcattc ccagaacgac tcggtcgagc actcaccagt cacagaaaag catctcacgg 13020 
atggcatgac agtaagagaa ttatgcagtg ctgccacaac catgagtgat aacactgcgg* 13 0 80 
ccaacttact tctgacaacg atcggaggac cgaaggagct aaccgctttt ttgcacaaca 13140 
cgggggatca cgtaactcgc cttgatcgct gggaaccgga gctgaatgaa gccataccaa 13200 
acgacgagcg tgacaccacg atgcctgtag caatggcaac aacgttgcgc aaaccactaa 13260 
ctggcgaact actcacccta gcttcccggc aacaattaat agaatggatg gaggcggata 13320 
aagttgcagg accacttctg cgctcggccc ttccggctgg cnggttcatt gctgataaat 13380 
ctggagccgg tgagcgeggg cctcgcggta tcattgcagc actggggcca gatggtaagc 13440 
cctcccgtat cgtagttatc tacacgacgg ggagteaggc aactatggat gaacgaaata 13500 
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gacagatcgc tgagataggt gcctcacega ttaaaeat- c B 

actcatatae accccagatt gatttaaaac ttclSSISl 3 ' aact9tca 9accaagCtt 13560 
agatcctttt tgataacccc ItgacSaaaa t^StaacS ESSE" ? tct ««*0« 13620 
cgccagaccc cgtagaaaag atlaaaggat ctt«S!S? £ gag ^ tcg ecc cactgag 13 680 
tctgctgcct gcaaacaaal aaaccaSgc £22«-£222E ^gcgcgtaa 13740 
agctaccaac tctctttccg aaggtaacta octtcf=?f« 99«tgettg ccggatcaag 13800 
tccttctagt gcagccgtaf ttSgc^cf KSlEf a3c H ca 9 ata ceaaatactg 13860 
acctcgctct gcelatcctl tcacfaScS Serf!? 8 * f tct 9 ta 9 c a ccgcccacat 13 920 
ccgggttgga ctcaagacgl tagt?aSS? 2E22S t9 9 c ? ataa 9 tcgtgtctta 1398? 
gttcgtgcac acagcecafc ttggagcgll cgatctac- ^I 3CC " 9C ****<*9S93 14040 
gtgagctang agaaagcglc acilttScS SSSSSSl of™'* 9 !? 3 taccCaca 9C 14100 
gcggcagggt cggaacagga gagcacaeal «a«f^E a 93cggacagg tacccggtaa 14160 
tttatagccc tgtcgggttt q^SSSS IafSSa«c« a ^ 93 ? aaac Scrctggcaec 1422? 
caggggggcg gagccEatgg alaaacgcca SaaS^ff? ~? aCbtCt 9 c S«tgctcgt 14280 
tctgctggcc ttttgctcll atgXct"? *£3S5g ttccc 99°« 14340 

gtattaccgc c ^scgccaec ccctgattcc gtggataacc 14400 

14411 
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Claims 

1. Method for generating transgenic eukaryotic cells having a modified Rosa 26 locus 
which method comprises the following step . 

(a) introducing a functional DNA sequence into the Rosa26 locus of startin 
eukaryotic cells by homologous recombination with a targeting vectc 
• comprising said functional DNA sequence flanked by DNA sequence 
homologous to the Rosa26 locus. 

2. The method of claim 1, wherein the eukaryotic cells 

(i) are derived from a multi-cell organism including vertebrates 
invertebrates and plants, preferably are vertebrate cells, more preferabl- 
are derived from a mammal, including rodents such as mouse, rat, etc. 
or a fish such as zebrafish; and/or 

(ii) are primary cells or immortalized cells; 

most preferably the cells are mammalian embryonic stem (ES) ceils. 

3. The method of claim 1 or 2, wherein 

(i) the functional DNA sequence is a gene expression cassette comprising a 

gene of interest operatively linked to a promoter or is a DNA sequence 
which can be converted into such gene expression cassette;and/or 

(11) the gene of interest is selected from recombinases, reporter genes, 

receptors, signaling molecules, transcription factors, pharmaceutically 
active proteins and peptides, drug target candidates, disease causing 
gene products, toxins, etc.; and/or 

(Hi) the promoter is a ubiquitous or tissue specific promoter, either 

constitutive or inducible, preferably is a CAGGS, hCMV, PGK, FABP, Lck, 
CamKII, CD19, Keratin, Albumin, aP2, Insulin, MCK, MyHC, WAP, Col2A, 
Mx, tet or Trex promoter; and/or 

(Iv) the DNA sequences homologous to the Rosa26 locus are 0.2 to 20 kB, 
preferably 1 to 10 kB long; and/or 



(v) the functional DNA sequence or gene expression cassette further 

comprises one ore more additional functional sequences including but not 
limited to marker genes, recombinase recognition sites, poly A signal 
introns, etc.; and/or ' 



(vi) 



the targeting vector further comprises tags for protein detection, 
enhancers, selection markers, etc. 



™^T Z ' Wh8re,n tranS9en ' C eUkarV0Bc ce " s are ""ived 

mouse, the DNA sequences homologous to *e Rosa 26 iocus are derived from the 5 

m 3 fl ,„ k ,ng arm of the mouse Rosa 26 iocus, preferabiy said homoiogous ONA 
sequences having the sequences shown in SEQ ID NO:4 and 5, respecdvely end the" 
promoter is a CAGGS-promoter, most preferably the targeting vector has the 
sequence shown In SEQ ID N0:7. 

s " ^"T" 9 to any one of daims 1 10 4 ' wh,ch further 

(b) isolating the eukaryotic cells, preferably the ES cells having the desl«d 
functional DNA sequence integrated into the Rosa26 .ocus ; and/or 
CO modifying the integrated funcdona. DNA sequence and isoladng ES ce.,s having 
the desired modified functional DNA sequence. 

6. A targeting vector as defined in claims 1 to 4. 

7. A eukaryotic eel, having a modified Rosa 2 6 locus obtainable by the method of claims 

8. A method for preparing transgenenic multi-ce.1 organism having a modified Rosa 26 
locus which comprises utilizing the method as defined in claims 1 to 5. 

9- The method of claim 8, wherein the transgenenic multi-cel, organism ls a non . 
12 ^ me ' h0d C ° mPriS6S m<>dif/in9 an K C6H 35 defl -" 
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10. The method of claim 9 which further comprises one or more of the steps 

(d) injecting ES cells obtained In steps (b) or (c) Into blastocysts; and/or 

(e) generating transgenic non-human animals carrying one or more functional gene 
of Interest at the Rosa26 locus. g 

11. A transgenic multi-cell organism and a transgenic non-human mammal obtainabl 
by the method of claim 8 and 9-10, respectively/ and having an operative! 
functional gene expression cassette integrated into its Rosa26 locus. 

12. Use of the eukaryotlc cell of claim 7, the transgenic multi-cell organism of clafrj 
11, or the transgenic non-human mammal of claim 11 for gene function studies 
drug development, as disease model animals, etc. 



27 

Abstract 



The .nvent.cn prov,des a method for targeted transgenesis usl n g the Rosa26 ,ocus 
Suable nuc.eot.de add sequences and vectors for the targeted transganes are 
proved The Rosa** -ocus proved to he a soltabie .ntegraoon L al.oJn" an d 
pred,ctabre expression of .nserted transganas casing exogenous promoted 
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